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DESCRIPTION 

PARALLEL MERGE /SORT PROCESSING DEVICE, METHOD, AND PROGRAM 

5 - TECHNICAL FIELD 

The technology to be a core of the present invention is a 
technology to sub-divide two sorted data string pairs into a 
plurality of sorted partial data string pairs while considering 

10 the magnitude of the key value so as to perform merge/sort 

processing at an arbitrary number of parallelism with inputting 
two sorted data strings. 

As an example of using this technology, the parallel two- 
branching merge/sort method will be described first- At first, 

15 an unsorted data string is divided by the number of available 
processors and the sort operation is performed in parallel, so 
as to generate the sorted partial data strings for the number of 
parallelism. Then the merge operation is performed using the 
two sets of sorted partial data strings as a pair, but in order 

20 to increase the processing speed, the input data string pair is 
sub-divided into the required number of data string pairs, and 
merge operation is performed in parallel respectively. For this 
output, the output area is specified by the parameters at 
execution of each merge operation, so that the result becomes 

25 the same as that when one merge operation is performed without 
sub-division. The present invention relates to a parallel 
merge/sort processing device, method, and program characterized 
in that this operation is repeated until a single sorted data 
string is received* 
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Data sorting is one of the basic processings using a 
computer . Sort processing and merge processing are operations 
for receiving one series of data strings by sorting the data 
strings which were input in ascending order (from small to 
large) or descending order (from large to small) on the 
information on the target field. In the merge operation, two or 
more series of data (s6rt target data or record) strings sorted 
in ascending order (or descending order) are received, and one 
series of data strings sorted in ascending order or descending 

order is output. 

The number of input series varies, as in 2, 3, 4, - - , but 
the case of 2 is particularly efficient since one unit of a data 
15 sort position can be defined by one comparison. 

Since the order of time required for merge processing is in 
proportion to the number of data (n) , the merge operation is 
advantageous compared with the sort operation, of which the 
order of required time is in proportion to n 2 or nlog 2 n. However, 
the required time increases as the data count increases, so a 
high-speed algorithm (calculation method) is demanded. The 
development of a high performance algorithm may expand the 
merge/sort application range. But this operation is simple, the 
comparison of two key values, so the possibility of finding 
factors to decrease time is low. The present invention would 
allow the parallelism of merge processing using two sorted 
partial data string pairs as input, which has been considered 
difficult in conventional technology- 
Decreasing processing time by performing merge processing 
on volumes of data simultaneously in parallel using a parallel 
processor has been proposed. However, in many cases a special 
topology is required for the connection network between 
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processors or a processor with special functions and structure 
is required, and this implementation is difficult for a general 
purpose computer system, such as a tightly-coupled multi- 
processor sharing a main storage* The present invention 
5 implements a merge/sort method/ which has high parallelism even 
when a general purpose processor, that has no special 
configuration or connection, is used, but if such a method is 
not used, processing efficiency becomes very low, as the 
following examples show. 

10 As an example of a parallel merge/sort method that can be 

applied to a general purpose parallel processor system, the 
parallel two-branching merge/sort calculation method will be 
described. Fig. 13 is a flow chart thereof/ and Fig. 14 is a 
diagram depicting the processing when the number of processors 

15 is eight. 

With reference to Fig. 13, the processing target unsorted 
data string (input data) is divided into p sets of data strings 
(where p = 2 q , q is an integer) , which corresponds to the number 
of processors (S100) . Using the p number of processors, the 

20 divided p sets of unsorted data strings are sorted independently 
and in parallel by. a quick sort method, for example (S101) . By 
performing q steps of merge processing for the p sets of sorted 
partial data strings (S102) , one set of sorted data strings can 
be finally acquired as a whole (S103) . 

25 With reference to Fig. 14 , processing when the number of 

processors is eight will be described. In Fig. 14, a circle 
indicates a processor, and a square indicates a data string or 
an area wnere a data string (D) is stored. The symbol in the 
circle indicates the content of processing to be executed by the 

30 processor, where S indicates sort processing, M indicates merge 
processing, and V indicates transfer processing to another 
storage area, which is executed when necessary. 
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The unsorted data strings in the input area are divided 
into eight partial data strings- The eight processors PI - P8 
execute sort processing of these partial data strings 
simultaneously in parallel. The processing results are stored 

in the areas Dll - D18 . 

Then the merge processing in the first step is executed. 
The data strings Dll and D12 are merged by the processor Pi and 
stored in the area D21, the data strings D13 and D14 are merged 
by the processor P3 and stored in the area D22, the data strings 
D1S and D16 are merged by the processor P5 and are stored in the 
area D23, and the data strings D17 and D18 are merged by the 
processor. P7 and stored in the area D24. In the first step, the 
processors P2, P4 and P8 are not used- (Here processors are 
assigned for the sake of convenience. This is the same for the 
15 description herein below.) 

Then the merge/sort in the second step is executed. The 
data strings D21 and D22 are merged by the processor Pi, and 
stored in the area D31, and the data strings D23 and D24 are 
merged by the processor PS and stored in the area D32. In the 
second step, the processors P2, P3, P4, P6, P7 and P8 are not 
used. 

Then the merge/sort in the third step is executed. The 
data strings D31 and D32 are merged by the processor PI and 
stored in the area D4. The merge/sort is now completed. In the 
third step, the processors P2, P3, P4, P5, P6, P7 and P8 are not 
used. The acquired result, D4, is transferred to the final 
output area by the eight processors if necessary. 

in the above-mentioned conventional method, processors 
which are not used increase as the merge processing steps 
advance, so the processing capability of the processors is 
wasted. In the case of the above example, only 50% of the 
processors are used in the merge processing in the first step, 
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and this processor utilization ratio is 25% in the merge 
processing in the second step, and 12.5% in the merge processing 
in the third step, thus the number of processors which are hot 
used increases as the processing steps advance. This is because 
5 the number of sorted partial data strings decreases to half each 
time the merge processing steps advance - 

Also the n umb er of data to be merged by one processor 
increases to double as the steps advance, so processing time 
increases . In the case of a system which gives priority to 

10 high-speed processing, this increase of processing time, due to 
the increase of the data volume to be handled by one processor, 
becomes a more serious problem than the problem of a drop in the 
utilization ratio of the processors. 

with solving the above problems in view, it is an object of 

15 the present invention to provide a parallel merge/sort 

processing device, method, and program which can increase the 
utilization efficiency of the processor in the merge/sort 
processing using parallel processors, and can decrease 
processing time. 

20 

DISCLOSURE OF THE INVENTION 

[Paragraph 0014] 

The present invention decreases the time required for merge 

25 processing by sub-dividing two sorted partial data string pairs, 
to be the input of the merge processing, into arbitrary data 
string pairs, and performing merge processing in parallel. 
Since a shared data group is referred to by a plurality of 
processors, the present invention is particularly effective for 

30 parallel processors where a plurality of processors are 

connected sharing a. main storage, but can be applied to any 
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parallel processors in an environment where. a data group can be 
referred to by each processor. 

in parallel two-branching merge/sort processing, an 
unsorted input data string is divided into a plurality of sets, 
and sort processing is executed for each set and sorted partial 
data strings are created. Hereafter, two sets of existing 
sorted partial data strings are paired and merge processing is 
performed to output one set of sorted partial data, and this 
processing step is repeated until the number of output data 
strings finally becomes one. In each merge processing step, 
merge processing is executed in parallel as long as there are a 
plurality of data string pairs to be input. 

parallel processing of merge processing, which is one unit 
job or task, is impossible with conventional technology, where 
parallelism decreases by half and processing time doubles as the 
merge processing steps advance. 

According to the present Invention, a pair of two sets of 
sorted partial data strings (sorted partial data strings 
acquired by the sort processing or merge processing shown in Fig. 
14) to be the input of the merge processing are sub-divided into 
a plurality of sorted partial data string pairs, so the 
processors for the number of sub-divisions can be used, and time 
required for the sort processing can be decreased. 

In order to increase parallelism, the following method can 
also be used. In other words, a method for starting merge 
operation from the side of which the key value is greater (start 
end) for two sets of sorted data string pairs to be input, U3ing 
two processors, and output ting the result in descending order 
from the start end of the output area (merging in descending 
order) , and a method for starting merge operation from the side 
of which the key value is smaller (termination end) and 
outputting the result in ascending order from the termination 
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end of the output area (merging in ascending order) , are 
executed simultaneously in parallel. 

Since this method imposes no restrictions, this method can 
be easily implemented in an environment where a plurality of 
tasks can operate in parallel. 

In the following description, only -sorting in descending 
order by one key" will be described for the sake of convenience. . 
"Sort" and "merge" are used for terms related to sorting. 

The parallel merge/sort processing device according to the 
present invention is basically comprised of a mechanism for 
dividing two series of input data string pairs into a plurality 
of sets of two series of input data string pairs used for the 
merge operation of a conventional method, and a mechanism for 
performing merge operation to these sets in parallel. 

How to assign the processor group to both mechanisms is 
arbitrary. In other words, a plurality of processors may 
execute the sub-division of the data string pair and then the 
processors are reassigned for the merge processing/ or a same 
processor may be in-charge of the sub-division operation of the 
20 data string and the merge operation thereafter. 

The output of the respective merge operation must have a 
same result as the case of executing one merge operation. For 
this, in each merge operation which operates in parallel, the 
result must be output to an area corresponding to the magnitude 
25 of the key value of the entire sub-divided input data string 
pairs. 

The processor group assigned to the partial data string 
pairs executes merge processing respectively, and also outputs 
the result of the merge processing to the pre-instructed area. 
For this, the pair at the first memory address of the area where 
the input data string pairs of the merge processing are stored, 
the number of data to be processed, and the first address of the 
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output area, are specified by the management processor, for 
example* 

Two processors are assigned to each one of the partial data 
string pairs, and the first processor executes merge processing 
in descending order from one end (side at which the key value is 
greater) of the partial data string,, and the second processor 
simultaneously executes merge processing in ascending order from 
the other end (side at which the key value is smaller) of the 
same partial data string, wherein the first processor writes the 
result of the merge processing sequentially from the one end of 
the partial data string, and the second processor writes the 
result of the merge processing sequentially from the other end 
of the part of this area* 
[Paragraph 0020] 

As described in paragraph 0014/ one data string pair is 
sub-divided into a plurality of data string pairs, but in the 
sub-division, the operation to divide into data strings in which 
a specified number x of data exists from the specified start 
point of the data string pair and the rest of the data strings 
can be repeated with changing x. 

Therefore the basis of the sub-division is dividing the 
data strings into two, and the; two sorted partial data string 
pairs acquired as a result must satisfy the following conditions - 

The sorted data string D comprised of n number of data is 
expressed as (D, n) , and a pair of two data strings is expressed 
as {(Di, n), <D 2 , n) } * 

When the pair of sorted data strings, { (Di, n) , (D 2 , n)}, is 
divided into two partial data strings { (Dn, n u ) , <D 2 i, n 2 i)}, and 
{ (D12, ni 2 ), (D 22 , n 22 )}, and these partial data strings are used 
for merge processing, the following formula must be established 
in order to output such that the magnitude of key values does 
not contradict the entire data strings. 
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The smaller one of the key values of the last part of the 

data in the partial data strings Du and Da > greater one of the 

key values of the first part of the data in the partial data 

strings D12 and D22- 

nn + n 21 - the number of data included in the significant 

data string pairs divided into 2 

. a parallel merge/sort processing method according to the 
present invention is a method for performing merge/sort 
processing using a parallel processor which includes a plurality 
of processors, where one unsorted data string, or two sorted 
partial data strings, or one of three or more sorted partial 
data strings and attribute information thereof (e.g. type, 
location information of data and number of data) are received as 
an input, comprising: 

a first step of dividing a data string and acquiring a plurality 
of unsorted partial data strings when the sorting target is an 
unsorted data string; 

a second step of assigning a processor to said plurality of 
partial data strings for sorting respectively; 
a third step of sorting each of said plurality of partial data 
strings independently by the assigned processor and outputting 
the sorted partial data strings in descending order; 
a fourth step of creating an input data string pair for 
merging/sorting using arbitrary two sorted partial data strings 
25 which were acquired in the third step or the seventh step or was 
input as initial data, and dividing the pair into a required 
number of sub-divided partial data string pairs respectively 
under a predetermined division condition; 

a fifth step of editing the job information required for merge 
30 processing on the sub-divided partial data string pairs ; 

a sixth step of assigning a processor to each merge processing; 
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a seventh step of performing merge processing by the assigned 
processor and outputting sorted partial data strings; and 
a step of repeating the fourth step to the seventh step until 
the number of output data strings becomes one. 

The above, however, is applied when one unsorted data 
string is provided, and the first step to the third step are 
unnecessary if two or more sorted data strings are provided. 

The repeat of the fourth step to the seventh step ends in 
the stage when the merge-processed sorted partial data strings 
are merged into one data string. If two sorted data strings are 
provided, the fourth to seventh steps are executed only once, 
and a repeat is not required - 

It is preferable that in the sixth step, two processors are 
assigned to the partial data string pair, and in the seventh 
step, the first processor performs merge processing in 
descending order from one end (side of which the key value is 
greater) in the partial data string, and the second processor 
performs merge processing in ascending order from the other end 
(side t>f which the key value is smaller) in the same partial 
data string simultaneously* 

(Deciding the position of the first part in the insignificant 
data string pair after two-division is performed) 

The two-division method for a data string pair shown in 
paragraph 0020 will be described below. 

This method comprises: 
• a step of setting x as an initial value of the index 
variables i and j for specifying the individual data in the data 
strings Di and D 2 (e.g. when the data string pairs, with 10,000 
data in each data string, are divided into 10 segments of data 
string pairs, with 1000 data in each data string, x * 1000 is 
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set as the number of data, and 1000,, . „ . 9000 is set for the 
initial index of division positions) ; 

• a comparison step of comparing a key value of data indicated 
by the index variable i of the data string Di and a key value of 

5 data indicated by the index variable j of the data string D 2 ; 

• a step of adding 1 to an index variable with a greater key 
value, subtracting 1 from an index value with a smaller key 
value, and then branching processing to the comparison step, if 
the key value of the data indicated by the index variable i of 

10 Di and the key value of the data indicated by the index variable 
j of T>2 are not the same in the initial comparison; 

• a step of adding 1 to the greater index variable, and 
subtracting 1 from the smaller index variable, and then 
branching processing to the comparison step, if the magnitude 

15 relationship of the key value of the data indicated by the index 
variable i of Di and the key value of the data indicated by the 
index variable j of D 2 is unchanged in the second or later 
comparison; 

• a step of regarding the data indicated by the index variable 
20 i and the data indicated by the index variable j as a division 

boundary respectively, if the key value of the data indicated by 
the index variable i of Di and the key value of the data 
indicated by the index variable j of D 2 are the same in the 
initial comparison; 

25 • a step of comparing the greater one of the key value of Di and 
the key value of D 2 in the previous comparison operation with 
the greater one of the key value of Di and the key value of D 2 in 
the current comparison operation and regarding the data with the 
smaller key value as the division boundary and the data 

30 initially compared with this data as the other boundary, if the 
magnitude relationship between the key value of the data 
indicated by the index variable i of T>i and the key value of the 
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data indicated by the index variable j of D 2 is inverted from 
the previous magnitude relationship (the case when the key 
values are the same is also regarded as an inversion in the 
second or later comparison) . 

The processing is used for determining the position of the 
first part of the latter half data string pair { (D i2 , ni 2 ), (D 22 , 
n 22 ) } after two-division is performed, in other words, for 
determining the start point of merging in descending order . 

(Deciding the position of the last part in the significant data 
string pair after two-division is performed) 

The two-division method shown in paragraph 0020 will be 
described below* 

This method comprises: 
• a step of setting x-1 as an initial value of the boundary 
index value for the index variable i and j for specifying 
individual data in the data strings Di and D2 (e.g. when the data 
string pairs, with 10,000 data in each data string, are divided 
into 10 segments of data string pairs, with 1000 data in each 
data string, 1000 is set for x; as the number of data, and 999, 
1999, • _ . 8999 is set for the initial index of division 
position) ; 

a comparison step of comparing a key value of data indicated 
by the index variable i of the data string Dl and a key value of 
data indicated by the index variable j of the data string D2; 

a step of adding 1 to an index variable of the data witn 
greater key value, subtracting 1 from an index variable of the 
data with smaller key value, and then branching processing to 
the comparison step, if the key value of the data indicated by 
the index variable i of D x and the key value of the data 
indicated by the index variable j of D 2 are not the same in the 
initial comparison; 
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a step of adding 1 to an index variable of the data with a 
greater key value, subtracting 1 from an index variable of the 
data with a smaller index value , and then branching processing 
to the comparison step, if the magnitude relationship of the key 
value of the data indicated by the index variable i of Di and 
the key value of the data indicated by the index variable j is 
unchanged in the second or later comparison; 

• a step of regarding the data indicated by the index variable 
i and the data indicated by the index variable j as a division 
boundary respectively, if the key value of the data indicated by 
the index i of Di and the key value of the data indicated by the 
index variable j are the same in the initial comparison; 

* a step of comparing the smaller one of the key value of D; 
and the key value of D2 in the previous comparison operation 
with the smaller one of the key value of Di and the key value of 
Dz in the current comparison operation, and regarding the data 
with a greater key value as the division boundary and regarding 
the data initially compared with this data as the other boundary, 
if the magnitude relationship between the key value of the data 
indicated by the index variable i of D x and the key value of the 
data indicated by the index variable j of D 2 is inverted from 

the previous magnitude relationship (the case when the key 
values are the same is also regarded as an inversion in the 
second or later comparison) . This processing is used for 
determining the position of the last part of the first half data 
string pair { (Dn, nu), (D 2l , n 21 J } after two-division is 
performed, in other words, for determining the start point of 
merging in ascending order. 

The present invention is a parallel merge/sort processing 
method for sub-dividing two sets of sorted data string pairs to 
be the input of merging/sorting into a plurality of data string 
pairs while considering the magnitude distribution of key values, 
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so as to enable a merge/sort processing at an arbitrary 
parallelism, comprising: a step of providing a first sorted data 
string and a second sorted data string; a step of sub-dividing 
the pair of the first data string and the second data string 
5 into a plurality of sorted partial data string pairs according 
to a required parallelism, considering the magnitude of the key 
values from the first part in both data strings; and a step of 
inputting the divided sorted partial data string pairs and 
merging them in parallel* 

10 The present invention is a program for performing parallel 

merge/sort processing using a parallel processor which includes 
a plurality of processors, where one unsorted data string or two 
sorted partial data strings,, or one of three or more sorted 
partial data strings and attribute information thereof (e*g. 

15 type, location information of data, number of data) are received 
as an input, comprising: 

a first step of dividing a data string and acquiring a plurality 
of unsorted partial data strings when the sorting target is an 
unsorted data string; 
20 a second step of assigning a processor to the plurality of 
partial data strings for sorting respectively; 

a third step of sorting each of the plurality of partial data 
strings independently by the assigned processor and outputting a 
sorted partial data string in -descending order; 

25 a fourth step of creating an input data string pair for 

merging/ sorting using arbitrary two sorted partial data strings 
which were acquired in the third step or the seventh step or was 
input as initial data, and dividing the pair into the required 
number of sub-divided partial data string pairs respectively 

30 under a predetermined condition; 

a fifth step of editing the job information required for merging 
the sub-divided partial data string pairs; 
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a sixth step of assigning processors to each merge processing; 
a seventh step of performing merge processing by the assigned 
processors and outputting the sorted partial data strings in 
descending order; and 

5 a step of repeating the fourth step to the seventh step using 
the merge-processed data strings as the partial data strings 
until the number of output data strings become one. 

The first step to the third step, however,, are necessary 
only when the input data string is unsorted. The repeat of the 

10 fourth step to the seventh step ends in the stage when the 

merge-processed sorted partial data strings are merged into one 
data string. If two sorted data strings are provided, the 
fourth step to the seventh step are executed only once, and a 
repeat is not required* 

15 The program according to the present invention is recorded 

on a recording medium/ for example. 

The media includes, for example, an EPRCM device, flash 
memory device, flexible disk, hard disk, magnetic tape, magneto- 
optical disk, CD (including CD-ROM, Video-CD), DVD (including 

20 DVD-Video, DVD-ROM and DVD-RAM) , ROM cartridge, RAM memory 

cartridge with backup batter y> flash memory cartridge and non- 
volatile RAM cartridge - 

Cable communication media, such as telephone lines, and 
radio communication media such as microwave lines, are also 

25 included. The Internet is included in the communication media 
referred to herein. 

A medium is something where information (primarily digital 
data and programs) is recorded by a physical means, and can have 
a processing device, such as a computer and dedicated processor, 

30 execute predetermined functions. In other words, a medium can 
be anything that can receive a downloaded program using a 



computer by some means, and .have the computer execute 
predetermined functions . 

BRIEF DESCRIPTION OF THE DRAWINGS 

Fig. 1 is a diagram depicting an overview of the parallel 
merge/sort processing device according to the present invention; 

Fig. 2 is a flow chart depicting the processing according 
to Embodiment 1; 

Fig. 3 is a diagram depicting the entire processing 
according to the embodiment; 

Fig. 4 is a diagram depicting the processing according to 
the embodiment ; 

Fig. 5 is a diagram depicting the processing according to 
the embodiment ; 

Fig. 6 is a diagram depicting the processing according to 
the embodiment ; 

Fig. 7 is a diagram depicting a part of the processing 
according to the embodiment; 

Fig. 8 is a flow chart depicting the area division 
according to the embodiment (merging in descending order) ; 

Fig. 9 is a flow chart depicting the area division 
according to the embodiment (merging in ascending order) ; 

Fig. 10 is a diagram depicting area division according to 
the embodiment; 

Fig. 11 is a diagram depicting an example of area division 
according to the embodiment ; 

Fig. 12 is a flow chart depicting the processing according 
to Embodiment 2; 

Fig. 13 is a flow chart depicting a conventional parallel 
merge/sort processing; and 
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Fig. 14 is a diagram depicting conventional merge/sort 
processing. 

BEST MODE FOR CARRYING POT THE INVENTION 

5 

Embodiment 1 

The device/method according to an embodiment of the present 
invention will be described with reference to the drawings . 

Fig. 1 shows a general configuration of this device. 1 is 

10 a parallel processor for merge processing that includes a 

plurality of processors and can execute a plurality of tasks 
simultaneously in parallel. 2 is a parallel processor for area 
division that divides two sets of sorted data storing pairs into 
a predetermined number of partial data string pairs and returns 

15 them to the processor for management 4. 3 is a parallel 

processor for sort processing that sorts unsorted data strings 
.existing in a specified area. 4 is a management processor which 
is in-charge of general control , and receives such information 6 
as a location of input data and the number of data, which is 

20 received from the outside at the start of the job, and performs 
such processing as editing information required for tasks to be 
executed by the parallel processor group to execute, starting 
the tasks and accepting task end reports. 5 is a data storage 
area which includes the sorting target data. 

25 a same parallel processor- may be used for 1 and 3 r which is 

the same for the parallel processor 2, for dividing a data 
string pair. 

Parallelism is improved by dividing two sorted data string 
pairs, which are handled only* for one merge processing in 
30 conventional technology, into a plurality of partial data string 
pairs. In the case of the later mentioned example in Fig. 3, a 
pair of data strings D21 and D22 to be input for merging are 
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divided into two, so as to acquire two sets of input data string 
pairs, which can be simultaneously merged in descending order 
and merged in ascending order respectively by the processors PI 
- P4, and the pair of data strings D31 and D32 is divided into 
four to acquire four sets of input data string pairs / which can 
be simultaneously merged in descending order and merged in 
ascending order respectively by the processors PI — P8 - This 
will be described in detail later. 

In the case of the parallel two-branch merge/sort method 
shown in the flow chart in Fig, 2, the sorting target data 
string is divided into p sets of unsorted data strings which 
have a relationship p = 2 q , and each data string is 
independently sorted by a quick-sort method, for example. 

SI - S3 are procedures for dividing the input data string 
into a plurality of data strings, and having P number of 
processors execute sort processing simultaneously in parallel. 
Then q steps of merge/sort are repeated, and finally one set of 
sorted data strings is acquired as a whole. 

SI: Divides the unsorted data string to be sorted into p 
sets of unsorted data strings. Here p « 2 q , which is the number 
of available processors, and q indicates the number of steps of 
merging, p is increased if a short processing time is desired, 
but in this case, q also increases because of the relationship 
of logap* The variable a, to indicate the merging step to be 
used, is initialized to 0 for convenience of explanation. 
S2: Assigns the p number of processors to the divided p 
segments . 

S3: Sorts the data strings independently. 

34: Divides the sorted partial data string pairs to be input 
for the merge operation. 
(1) a - a + 1 
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(2) 2**"* sets of input data string pairs are created by 
combining 2 q " a * 1 sets of sorted partial data strings two by two. 

Each respective input data string pair is divided into 2*" 1 
sets of sub-divided data string pairs. The method for dividing 
5 a data string pair will be described later. 

S5: Assigns processors for merging in descending order and 
for merging in ascending order to each of (2*-*) x (2* x ) — 2 q 1 
sets of sub-divided data string pairs. 

S6: Simultaneously executes merging in descending order and 
10 merging in ascending order to each set using two processors 
respectively, and acquires 2^" sets of sorted partial data 
strings. 

S7: Adds 1 to a if a q, and repeats the processing in S4 - S7 . 
S8 : By the above procedure, one set of sorted data strings can 

15 be finally acquired as a whole. 

In the method in Fig. 2, division is not performed in the 
first step, but two-division is performed for the area in the 
second step, four-division in the third step, eight-division in 
the fourth step, and 2^ division in the q-th step, and merging 

20 in descending order and merging in ascending order are applied 
to each area. In the case of the parallel two-branching 
merge/sort method, where the sorted data string division method 
in Fig. 2 is applied, a merge operation at parallelism p can 
always be performed. 

25 The processing of the parallel two-branching merge/ sort 

method when the number of processors is eight will be described 
with reference to Fig. 3. In Fig. 3, a circle indicates a 
processor, and a square indicates a data string or an area in 
which the data string (D) is stored. The symbol in the circle 

30 shows the content of the processing to be executed by the 
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processor, where S indicates sort processing, M indicates merge 
processing, and V indicates transfer processing. 

The data in the input area is divided into eight sets of 
unsorted data strings. The eight processors PI - P8 executes 
sort processing on these partial data strings simultaneously in 
parallel. The processing results are stored in the areas Dll - 
D18. 

Then the merging/sorting in the first step is performed - 
The data strings Dll and D12 are merged by the processors PI and 
P2 and stored in D21, the data strings D13 and D14 are merged by 
the processors P3 and P4 and stored in D22, the data strings D15 
and D16 are merged by the processors P5 and P6 and stored in D23, 
and the data strings D17 and D18 are merged by the processors P7 
and PB and stored in D24. 

In the first step, merging in descending order and merging 
in ascending order are performed simultaneously, so eight 
processors are used even if division is not executed. (The 
assignment of processors here is for the sake of convenience. 
This is the same herein below.) 

Then the merging/sorting in the second step is performed. 
The data string pairs are divided into two to use all the 
processors. In other words, the partial data string pair D21 
and D22 is divided into two while considering the magnitude of 
the key value, and a pair D21-1 and D22-1 and a pair D21-2 and 
D22-2 are created. This is the same for the data string pair 
D23 and D24. 

And the data strings D21-1 and D22-1 are merged by the 
processors PI and P2 and stored in one side of D31 (the side of 
which the key value is greater) , and the data strings D21-2 and 
D22-2 are merged by the processors P3 and P4 and stored in the 
other side of D31 (the side of which the key value is smaller) . 
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In the same way, the data strings D23 and D24 are merged by the 
processors P5 - P8 and stored in D32. 

Then the merging/sorting In the third step is performed. 
Division is executed in the same way as the second step. In 
other words, the data strings D31 and D32 are paired, which are 
divided into four while considering the magnitude of the key 
value r and the partial data string pairs D31-1 and D32-1, D31-2 
and D32-2, D31-3 and D32-3, and D31-4 and D32-4 are created 
respectively* The data strings D31-1 and D32-1 are merged by 
the processors PI and P2 and stored in a predetermined position 
in D4, the data strings D31-2 and D32-2 are merged by the 
processors P3 and P4 and stored in a predetermined position in 
the area D4, the data strings D31-3 and D32-3 are merged by the 
processors P5 and P6 and stored in a predetermined position in 
the area D4, and the data strings D31-4 and D32-4 are merged by 
the processors P7 and P8 and stored in a predetermined position 
in the area D4. By this, merging/sorting is completed. The 
acquired result D4 is transferred to the output area by the 
eight processors if necessary. 

In each step of the processing in Fig. 3, all of the eight 

processors are used. 

The processing in Fig. 3 will be described with reference 

to Fig- 4 to Fig. 6. 

The case of sorting a total of 3040 sets of data of eight 

partial data strings (number of data in each block =» 380) dlO - 

d!3 and d20 - d23 will be described. 

(1) dlO - d23 are sorted using eight processors 

respectively. The mid-stage in Fig. 4 shows the sorted status 

where the key values are arranged in descending order. In Fig. 

4, the triangle indicates the distribution of the magnitude of 

the key value in each block. 
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(2) A set of work areas elO - e!3 and e20 - e23 are 
created separately, and parallel merging is executed as the mid- 
stage in Fig. 4 shows. Then the result shown in the last stage 
is acquired. 

5 (3) The merging/ sorting in the second step is performed. 

{elO, ell) and (e20, e21) are combined and divided into two 
while considering the magnitude of the key value, so as to 
create two sets of data string pairs for merge input, and four 
parallel merge operations are executed (first stage in Fig. 5) * 

10 At the same time, (el2, el3) and (e22, e23) are combined and 
divided into two while considering the magnitude of the key 
value, so as to create two sets of data string pairs for merge 
input, and four parallel merge operations are executed (last 
stage in Fig. 5) - As a result, two series of four parallel 

15 mergings are executed in parallel. 

(4) The merging/sorting in the third step is performed - 
dlO - d!3 and d20 - d23 are combined and divided into four while 
considering the magnitude of the key value, so as to create four 
sets of merge input pairs, which are output to elO - e23. Each 

20 area is divided into 4, and processing in descending order and 
processing in ascending order are executed in each area, so 
eight processings are executed in parallel. 

Now the merge/sort processing in each step in Fig. 3 will 
be described in detail. Fig. 7 is a diagram depicting this. it 

25 is assumed that the input data string pairs for the input area 
(Di, n) and (D2, n) are provided. Here (Di, n) and (D2, n) mean 
the data strings Di and D2, which are comprised of n number of 
data respectively. In each area in Fig. 7, the data is sorted 
such that the key value is greater when approaching the left 

30 side and smaller when approaching the right side. 
[Paragraph 0048] 
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' It is assumed that the sorted partial data string pairs (Di f 
n) end (D 2 , n) are divided (two-division in the case of Fig. 7) 
into a pair (D lw nn) and (Dn, n 2X ) and into a pair (D 12f ni 2 ) and 
(D 22 , n«) by the data string pair division unit 2, shown in Fig. 
1, while considering the magnitude of the key value by the data 
str±ng pair division unit shown in Fig. 1. This division must 
satisfy the following formula. 

The smaller one of the key value of the data in the last 
part of D u and the key value of the data in the last part of D 2 i 
> greater one of the key value of the data in the first part of 
Di 2 and the key value of the data in the first part of D 22 . 
nn + n 2 i - 2x, niz + n 22 - 2 (n-x) 

Here 2x is a number of data comprising the data string pair 
( (Diu mi)/ (D 2 i, n 21 )}. 

The processor Pi performs merging in descending order and 
the processor P2 performs merging in ascending order for the 
data string pair (D u , nn) and (D 2 i, n 2l ) r wliich is input for the 
merge operation. 

The dotted line in Fig. 7 shows the sequence of merging. 
The result is sequentially output to (D31, n 3i ) , which is a part 
of the output area (D 3/ 2n) . 

In the same way, the processor P3 performs merging in 
descending order and the processor P4 performs merging in 
ascending order for the sorted data string pair <Di 2 , m 2 > and 
(D«, n«) . The result is sequentially output to (D 32 , n 32 ) , which 
is a part of the output area (D 3 , 2n) . 

Here (the key value of the data in the last part of D 3i ) 2s 
(the key value of the data in the first part of D 32 ) . 
n 3 t + n 32 = 2n 

By dividing the input data string pairs as above and 
assigning processors to the divided data string pairs 
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respectively, simultaneous parallel processing by a plurality of 
processors becomes possible. The above description is just an 
example/ and the present invention is not limited to this. 

Now a procedure to divide the sorted data string pairs will 
be described* As the example in Fig. 7 clearly shows, (the key 
value of the data in the last part of D3i) > (the key value of 
the data in the first part of D 3Z ) must be established for 
merging/ sorting to be correctly performed. D 3i is the 
merging/sorting result of D u and D 2i , and D 32 is the 
merging/sorting result of D12 and Da2. Therefore (the minimum 
value of the key values of the data included in Du or D 2X ) £ 
(the maximum value of the key values of the data included in D i2 
or D22) • This means that the condition of the area division of 
the present invention is as follows* 

In the division condition, when a pair of two pre-sorted 
data strings (Di, n) and (D 2 , n) is divided into two partial data 
string pairs, a pair (Du, nit) and (D 2i , n 2 i) and a pair (D 12 , n 12 ) 
and (D 22 , n 22 ) , (the smaller one of the key values of the data in 
the last part of the partial data string Du and that of the 
partial data string D 2i ) 2> (the greater value of the key values 
of the data in the first part of the partial data string D« and 
that of the partial data string D 22 ) must be established, and 
also nn + n 22 - 2x and m 2 + n 22 - 2n - 2x must be established. 
Here x is a half number of the number of data of the data string 
pairs when counted from the first part of Di and D 2 . 

For example, if two division operations, x = 300 and x = 
600, are performed for 1000 sorted data string pairs, these data 
string pairs are divided into three data string pairs , that is 
{(Du, an), (D2i, n 2 i)}, {(Di 2 , ni 2 ), (D 22 , n«)>, and { (D 13 , n 13 ) ; 
(D 23 „ n 23 )>, of which the total number of data in each data 
string pair is 600, 600 and 800 respectively. In this case, the 
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above-mentioned magnitude relationship must be satisfied between 
the key value of the 600 th data from the beginning when both data 
strings are combined {the smaller one of the key values of the 
data in the last part of D u and D*i) , and the key value of the 
601th data (the greater one of the key values of the data in the 
first part of D 12 and D a *) , and between the key value of the 
1200 th data and the key value of the 1201 st data. 

Now a sorted data string pair division method that 
satisfies the above condition will be described. Division of 
the data string pairs that satisfies the above condition can be 
implemented by the following division method for merging in 
descending order or by a division method for merging in 
ascending order, or by using both of these methods. 

Before describing a specific procedure of the sorted data 
string pair division method, a prerequisite thereof will be 
described. Two sorted data strings, D x and Ih, each of which is 
comprised of n number of data, are assumed, and both of the data 
strings are equally divided into k sets of data string pairs 
that include (n/k> sets of data. 

An individual data string is specified by a data string 
number, 0 to k-1. The position of each data is specified by an 
index value which increases 1 for each data with the first part 
of the data string 0. This position specification method is 
used not only for the data position specification of the entire 
Dl and D2, but also for the position specification in each 
divided data string. The start edge or the termination edge 
data position is specified by the expression -the first part" or 
-the last part" of the data string. 

in the. above-mentioned partial data strings D x and D 2 after 
equal division, the data strings with a same data string number 
are assumed as the temporary merging target data string pair. 
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In other words, the start point of merging in descending 
order, targeting the data string pair with the data string 
number i (i = 0 - k - 1) , is assumed to be the start edge of the 
remaining data string when (n/k) (i) sets of data is removed 
from the start edge of B x and D 2 , that is the point at which the 
index value is (n/k) (i) . 

The start point of merging in ascending order, targeting 
the data string with the data string number i (i « 0 - k - 1) , 
is assumed to be the last part of the data string where (n/k) (i 
+ 1) sets of data exist respectively from the start edge of Di 
and D 2 , that is the point at which the index value is (n/k) (i + 
1) - 1- 

The distribution of the key values in D x and in D 2 are 
independent from each other. Therefore even if data has been 

5 sorted in each data string, the sorted result, without any 

inconsistency in the distribution of the key values as a whole, 
cannot be acquired by simply performing a merge operation from 
the temporary start point. So the start point (division point) 
that satisfies the above condition will be determined by the 

> following procedure. 

<In the case of merging in descending order > 

There are two data strings, Di and D 2 , each of which 
consists of sorted n number of data. A data string pair which 
satisfies the condition of paragraph 0048 and the number of data 
from the first part of the data string is 2 (n/k) + 2, 4 (n/k) + 2, 
6 (n/k) 2(k-l) (n/k) + 2, and is determined with 

considering the magnitude of the key value from the first part 
of both data strings. This boundary is the index of the first 
part of the remaining data string pair when the data string pair, 
of which the number of data is 2 (n/k), 4 (n/k), 6 (n/k) . . . 2 (k-' 
l)(n/k), is removed. 
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The following refers to Fig, 8. 
S10: As the boundary initial value (or temporary boundary 
value) , a point with index value (n/k) , 2 (n/k) , 3 (n/k) , . . . 
(k-l)(n/k>, is selected. 

Sllr Sets the index value of the boundary initial value 
determined in S10 for the index variables for operation i and j 
in the Di and D2 areas. 

S12: Repeats the following procedure until the magnitude 
relationship is inverted. Including the initial comparison,, the 
case when the key values are the same is also regarded as an 
inversion. 

S12a: Compares the key values: of Di (i) and D 2 (j) . 
S12b: Judges the magnitude relationship. 

(1) Branches to S13a if both key values are the same in 
the initial comparison* 

(2) Branches to S13b if the magnitude relationship is 
inverted. 

(3) If both key values are not the same in the initial 
comparison, or if the magnitude relationship is not inverted in 
the second or later comparison, 1 is added to the index value 
with the greater key value, 1 is subtracted from the index value 
with the smaller key value, then processing branches to S12 
(Sl2c) • 

S13a: i and j are regarded as the indexes of the boundary point, 
and processing ends. 

SI 3b: Among comparison target data, data with a greater key 
value is compared immediately before and immediately after the 
change of the magnitude relationship, and the data with the 
smaller key value is regarded as boundary data. For the 
boundary data of the other area, data initially compared with 
this determined data is selected* 
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<ln the case of merging in ascending order> 

There are two data strings, Dx and D 2 , each of which 
contains n number of data. The last part of the data string 
which satisfies the condition of paragraph 0048 and the number 
of data from the first part of the data string is 2(n/k), 4 (n/k) , 
6(n/k), . * . 2(k-l) (n/k) is determined with considering the 
magnitude distribution of the key values of both data strings. 
Hereafter the following refers to Fig. 9. 

S20: As a boundary initial value (or a temporary boundary 
value), a point with index value (n/k)-l, 2(n/k)-l, 3(n/k)- 
1, - - - (k-l)(n/k)-l is selected. 

S21: Sets the index value of the boundary initial value 
determined in S20 for the index variables for operation i and j 
in the D x and D 2 areas. 

S22: Repeats the following procedure until the magnitude 
relationship is inverted. Including the initial comparison, the 
case when the key values are the same is also regarded as an 
inversion. 

S22a: Compares the key values of Di (±) and D* (j). 
S22b: Judges the magnitude relationship. 

(1) Branches to S23a if both key values are the same in 
the initial comparison. 

(2) Branches to S23b if the magnitude relationship is 
inverted. 

(3) If both key values are not the same in the initial 
comparison/ or if the magnitude relationship is not inverted in 
the second or later comparison, 1 is added to the index value 
with the greater key value, 1 is subtracted from the index value 
with the smaller key value, then processing branches to S22. 
S23a: Regards i and j as the indexes of the boundary point, and 
processing ends. 
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S23b: Among comparison target data, data with a smaller key 
value is compared immediately before and immediately after the 
change of the magnitude relationship , and the data with the 
greater key value is regarded as boundary data. For the 
boundary data o£ the other area, data initially compared with 
this determined data is selected - 

An example of dividing tne data string pairs according to 
the above procedure will be described. 

fig. 10 shows two data strings, including 10 numeric values 
respectively, which are sorted in descending order. Fig. 10 (a) 
is a diagram depicting the procedure to determine the start 
point of merging in descending order, Fig. 10 (b) is a diagram 
depicting the procedure to determine the start point of merging 
in ascending order, and Fig. 10 (c) shows the result of area 
division. 

Dl: (1948, 1868, 1862, 1148, 740, 620, 588, 256, 112, 100) 
D2: (1996, 1992, 1966, 1694, 1544, 1448, 1182, 432, 254, 30) 
[Paragraph 0059] 

<Deciding start point of merging in descending order>. 

(1) The first part of a data string, when five sets of 
data are excluded from the first part of the data string, 
becomes a temporal division point. This value is set for the 
indexes for operation i and j . It is assumed that i = j = 5 . 

(2) D x (i - 5) _ 620 and D 2 <j « 5) = 1466 are compared. 
Since 1466 is the greater, i becomes i - 1 = 4 and j becomes, j + 
1-6. 

(3) Dj (i = 4) - 740 and D 2 (j = 6) - 1182 are compared. 
Since 1182 is the greater, i becomes i - 1 = 3 and j becomes j + 
1 = 7. 

(4) Eh (I . 3) - 1148 and D 2 (j - 7) - 432 are compared. 
Since 1148 is the greater, the inversion of the magnitude 
relationship is recognized here. 



29 



(1) The greater value in each comparison before and after 
the inversion, that is, 1148 and 1182, are compared, and the 
point of the smaller one thereof, . that is 1148, is regarded as a 
boundary. The other boundary is the data initially compared 
with 1148, that is 432. If orie boundary is known, the other 
boundary index value may be determined using the following 
formula (1), that is 2 x 5 - 3 - 7. 

Index value of the other start point - index of temporary 
merging start point x 2 - index of the known start point 

<Deciding start point of merging in ascending order> 

(1) The point where five sets of data exist, from the 
first part of the data string, becomes a temporal dividing point 
This value is set for the indexes for operation i and j . It is 
assumed that i = j =» 4. 

(2) Di (i -p 4) - 740 and n (j « 4) - 1544 are compared, 
and since 1544 is the greater, i becomes i - 1 « 3 and j becomes 
3+1 = 5. 

(3) D L (i = 3} = 1148 and Di (j = 5) - 1446 are compared, 
and since 144 S is the greater, i becomes i - 1 « 2 and j becomes 
j + 1 - 6. 

(4) Di (i = 2) = 1862 and D* (j = 6) = 1182 are compared, 
and since 1862 is the greater, the inversion of the magnitude 
relationship is recognized here. 

(5) The smaller value in each comparison before an d after 
the inversion, that is 1148 and 1182, are compared, and the 
point of the greater one thereof, that is 1182, is regarded as a 
boundary. The other boundary is the data initially compared 
with 1182, that is 1862. The index value of the other boundary 
may be determined using the formula in paragraph 0059, that is 2 
X 4 - 6 - 2. 
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Fig. 11 (a) and (b) show another example. Fig. 11 (a) is a 
case when n - 12 # k = 4 and n/k = 3. Pig. 11 (b) is a case when 
n = 10, k * 5 and n/k = 2. As for this example, the above 
condition can be satisfied even if the output data, when 
potential data strings after division are merged, exists only in 
one of Dj. and D2 (do not extend over both data strings) . 

In the device/method according to the embodiment of the 
present invention, the number of divided segments can be 
arbitrarily determined according to the number of available 
processing devices. 

In the device/method according to the present invention, 
the effect is greater as the number of data is larger, since the 
ratio of the time required for dividing the sorted partial data 
string pair to be the input of merging/sorting into a plurality 
of data string pair, and the entire time is decreased. Among 
processing to divide the sorted partial data string pair to a 
plurality of data string pairs, if the processor assigned to the 
merge/sort operation executes the portion for determining 
substantial merging start point, then a more improved parallel 
processing becomes possible, and the ratio of the time required 
for area pair division to the entire processing time can be 
decreased. 

(1) Case when the effect is large 

When the distribution of the key values is almost the same 
for the data strings D x and D 2 , the magnitude relationship 
inverts in a short time. Therefore it does not take very much 
time to decide a data boundary to be the merging input. 

(2) Case when the division takes time 
The following cases are possible: 

(a) case when the decision of a division pair takes time 
When the segment number of the target data string pair is 
close to. k/2 at dividing into k segments, and the magnitude 
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relationship finally inverts at the first part or at the last 
part of Di or D2, that is near the edge of the data string, the 
inversion of the magnitude relationship takes time. This occurs 
when the key distribution is different between T>i and D z , but it 
5 is rare that this occurs with frequency. 

(b) Case when the key value areas do not overlap, and the 
casft when the magnitude relationship is not inverted. 
These are cases when comparison is continued seeking the 
inversion of the magnitude relationship , but inversion does not 

10 occur even if the front part of the end part of the area is 

reached* These cases can be handled as follows. If inversion 
does not occur even if the front part of the area is reached, 
inversion is forced by assuming an infinity key outside the 
front part, and if Inversion does not occur even if the end part 

15 of the area is reached, inversion is caused by assuming an- 
infinitesimal key outside the last part. 

In the above example, the number of data of the data string 
Di and that of D 2 are assumed to be the same. The present 
invention is not limited to this. The present invention can be 

20 applied also to the case when the number of data is different 

between Di and D 2 , only if the number of data is specified in the 
sort task or merge task. 

The present invention is a general purpose technology, and 
which is particularly suitable for general purpose parallel 

25 processor type computers sharing a main storage. However, the 
present invention will be able to be applied regardless the 
configuration and processing format of the system only if the 
sorting target data group can be commonly accessed. The present 
invention will be implemented as a part of programs integrated 

30 into the software (OS) of a parallel processor type computer. 

Embodiment 2 
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The device/method of Embodiment 1 uses merging in 
descending order and merging in ascending order simultaneously, 
as shown in Fig. 2. The present invention, however, can also be 
applied to the case when only one of the merging in descending 
order and merging in ascending order is used. Fig. 12 is a flow 
chart when the present invention is applied to the parallel 2 
branching merge/sort method* In this case, data is divided into 
two from the merging in the first step, so p number of 
processors always operate. The parallel processor to perform 
merge processing must have only one of the functions of merging 
in descending order or merging in ascending order, so 
implementation is easy. 

The processing content of Embodiment 2 is the same as the 
case of Embodiment 1, so description thereof will be omitted. 
The difference in Fig* 12 from Fig. 2 is S4b, S5b and S6b. 

The present Invention is not limited to the above 
embodiments, but can be modified in various ways within the 
scope of the invention stated in the Claims, which shall, 
needless to say, be included in the scope of the present 
invention . 

In the present description, '"means" does not always mean a 
physical means, but includes the case of when the function of 
each means is implemented by software. Also a function of one 
means may be implemented by two or more physical means, or two 
or more functions of means may be implemented by one physical 
means - 
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